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1  Summary 

We  have  made  substantial  progress  in  a  number  of  areas  covered  by  the  proposal.  We 
have  shown  how  to  efficiently  reconstruct  order-based  distributions  from  noisy  comparisons 
and  noisy  orders  thus  providing  important  examples  where  the  generating  model  is  not  a 
tree  but  can  still  be  recovered  efficiently.  We  have  made  some  important  discoveries  on 
reconstruction  of  Markov  random  fields,  essentially  showing  that  Markov  Random  Fields 
of  bounded  degrees  are  reconstruct ible  if  the  data  at  all  nodes  is  observable  and  that  the 
problem  is  computationally  intractable  if  some  nodes  are  hidden. 

For  trees,  we  have  studied  a  number  of  questions  concerning  aggregations  of  information 
from  different  sources,  including  mixtures  of  tree  distributions  and  an  analysis  of  gene  trees 
and  the  corresponding  population  tree. 

We  have  continued  studying  optimization  over  networks,  in  particular,  social  networks. 
Our  results  concern  optimization  of  spread  processes,  the  structure  of  Nash  Equilibrium 
in  random  games,  efficient  greedy  learning  in  networks  from  Gaussian  signals  and  the  con¬ 
struction  of  optimal  local  compression  graphs. 

Finally,  we  have  made  considerable  progress  in  studying  local  algorithms  for  calculat¬ 
ing  marginal  distributions  over  networks.  Most  importantly,  we  have  shown  that  MCMC 
methods  are  applicable  to  graphs  that  are  sparse  on  average.  Indeed  for  the  famous  Ising 
model  we  have  found  the  first  tight  conditions  which  ensure  convergence. 
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2  Progress  in  recovering  graph-based  distributions  from  sam¬ 
ples 

In  a  joint  work  with  Mark  Braverman  we  have  shown  how  a  structure  of  a  linear  network, 
i.e.  an  order,  may  be  recovered  efficiently  from  noisy  comparison  between  elements.  An 
extended  abstract  announcing  the  results  is  to  appear  in  proceedings  of  SODA  2008 

Noisy  Sorting  without  resampling  [3,  4].  We  study  problems  of  inferring  order  given 
noisy  information.  In  these  problems  there  is  an  unknown  order  (permutation)  n  on  n 
elements  denoted  by  1, . . .  ,n.  We  assume  that  information  is  generated  in  a  way  correlated 
with  7r .  The  goal  is  to  find  a  maximum  likelihood  n*  given  the  information  observed.  We 
consider  two  different  types  of  observations:  noisy  comparisons  and  noisy  orders. 

•  Noisy  Orders  (also  called  the  Mallow's  model).  Given  the  original  permutation  7r,  the 

probability  of  a  permutation  a  being  generated  is  proportional  to  .  In  other 

words,  the  probability  is  inverse  exponential  in  the  Kemeny  distance  of  7r  from  cr, 
which  is  the  number  of  pairs  ordered  in  7r  differently  from  cr: 

dK{*,o)  =  #{(i,  j)  :  *(i)  <  *(j)  and  a(i)  >  <r(j)}. 

We  assume  that  we  are  given  U\ , . . . ,  ar  that  are  generated  independently  conditioned 

On  7T . 

•  Noisy  Comparisons.  The  input  is  the  status  of  Q)  queries  of  the  form  q(i>  j),  for  i  <  j, 
where  q(i>j)  =  +  (  — )  with  probability  1/2+  (  if  7r(i)  >  7r(j)(7r(i)  <  7r(j>))  for  all  pairs' 
i  7^  jy  where  (>  0  is  a  constant.  It  is  assumed  that  the  errors  are  independent.  More 
generally,  the  input  may  be  any  collection  of  independent  biased  signals  on  the  order 
relationship  between  pairs  of  elements. 


In  our  results  we  present  polynomial  time  algorithms  for  solving  both  problems  with 
high  probability.  For  noisy  orders  the  running  time  of  the  algorithm  is  n1+°((^r)  \  and  for 
noisy  comparisons  the  algorithm  runs  in  time  Both  algorithms  have  O(nlogn) 

query  complexity  (with  the  constant  depending  on  A,  /?  and  r). 

As  part  of  our  proof  we  show  that  for  both  models  the  maximum  likelihood  solution  7r* 
is  close  to  the  original  permutation  7r.  More  formally,  with  high  probability  it  holds  that 

T  K00  _  7r*(i)l  =  ©(«).  max  |7r(i)  -  7r*(i)|  =  9(log  n). 
i 

Our  results  are  of  interest  in  applications  to  ranking,  such  as  ranking  in  sports,  or 
ranking  of  search  items  based  on  comparisons  by  experts. 

In  a  joint  work  with  Matsen  and  Steel  we  analyze  when  can  a  mixture  of  tree-based 
distributions  can  be  recovered  from  samples  of  the  mixture 

Mixed-up  trees:  the  structure  of  phylogenetic  mixtures  [10] .  In  this  paper  we 
apply  new  geometric  and  combinatorial  approaches  to  the  study  of  phylogenetic  mixtures. 
The  focus  of  the  geometric  approach  is  to  describe  the  geometry  of  phylogenetic  mixtures 
for  the  two  state  random  cluster  model,  which  is  a  generalization  of  the  two  state  symmetric 
(CFN)  model.  In  particular,  we  show  that  the  set  of  mixture  distributions  forms  a  convex 
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polytope  and  we  calculate  its  dimension;  corollaries  include  a  simple  criterion  for  when  a 
mixture  of  branch  lengths  on  the  star  tree  can  mimic  the  probability  distribution  on  splits 
of  a  resolved  quartet  tree  Furthermore,  by  computing  volumes  of  polytopes  we  can  clarify 
how  “common”  non-identifiable  mixtures  are  under  the  CFN  model.  We  also  present  a  new 
combinatorial  result  which  extends  any  identifiability  result  for  a  specific  pair  of  trees  of 
size  six  to  arbitrary  pairs  of  trees.  Next  we  present  a  positive  result  showing  identifiabil¬ 
ity  of  rates- across-sites  models.  Finally,  we  answer  a  question  raised  in  a  previous  paper 
concerning  mixed  branch  repulsion  on  trees  larger  than  quartet  trees  under  the  CFN  model. 

In  another  phylogenetic  related  project  with  Sebastien  Roch,  we  show  how  to  integrate 
information  from  different  gene  trees  in  order  to  recover  the  specie  tree. 

Incomplete  Lineage  Sorting:  Consistent  Phylogeny  Estimation  From  Multiple 
Loci  [13].  We  introduce  a  simple  algorithm  for  reconstructing  phylogenies  from  multiple 
gene  trees  in  the  presence  of  incomplete  lineage  sorting,  that  is,  when  the  topology  of  the 
gene  trees  may  differ  from  that  of  the  species  tree.  We  show  that  our  technique  is  statisti¬ 
cally  consistent  under  standard  stochastic  assumptions,  that  is,  it  returns  the  correct  tree 
given  sufficiently  many  unlinked  loci.  We  also  show  that  it  can  tolerate  moderate  estimation 
errors. 

We  have  shown  how  Markov  Random  Fields  can  be  reconstructed  from  observations  for 
bounded  degree  models. 

Reconstruction  of  Markov  Random  Fields  from  Samples:  Some  Observations 
and  Algorithms  [5]  Markov  random  fields  are  used  to  model  high  dimensional  distribu¬ 
tions  in  a  number  of  applied  areas.  Much  recent  interest  has  been  devoted  to  the  reconstruc¬ 
tion  of  the  dependency  structure  from  independent  samples  from  the  Markov  random  fields. 
We  analyze  a  simple  algorithm  for  reconstructing  the  underlying  graph  defining  a  Markov 
random  field  on  n  nodes  and  maximum  degree  d  given  observations.  We  show  that  under 
mild  non-degeneracy  conditions  it  reconstructs  the  generating  graph  with  high  probability 
using  0(dlogn)  samples  which  is  optimal  up  to  a  multiplicative  constant.  Our  results  seem 
to  be  the  first  results  for  general  models  that  guarantee  that  the  generating  model  is  re¬ 
constructed.  Furthermore,  we  provide  explicit  0(nd+2  logn)  running  time  bound.  In  cases 
where  the  measure  on  the  graph  has  correlation  decay,  the  running  time  is  0(n 2  logn)  for  all 
fixed  d.  We  also  discuss  the  effect  of  observing  noisy  samples  and  show  that  as  long  as  the 
noise  level  is  low,  our  algorithm  is  effective.  On  the  other  hand,  we  construct  an  example 
where  large  noise  implies  non-identifiability  even  for  generic  noise  and  interactions.  Finally, 
we  briefly  show  that  in  some  cases,  models  with  hidden  nodes  can  also  be  recovered. 

In  contrast  when  some  of  the  nodes  are  hidden  the  problem  is  computationally  hard. 

The  complexity  of  distinguishing  Markov  Random  Fields  [2]  Markov  random 
fields  are  often  used  to  model  high  dimensional  distributions  in  a  number  of  applied  areas. 
They  provide  a  compact  representation  of  the  distribution  that  is  polynomial  in  the  size 
of  the  dependency  graph  and  at  most  exponential  in  the  size  of  the  maximal  clique  in  the 
graph. 

A  number  of  recent  papers  have  studied  the  problem  of  reconstructing  a  dependency 
graph  of  bounded  degree  from  independent  samples  from  the  Markov  random  field.  These 
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results  require  observing  samples  of  the  distribution  at  all  nodes  of  the  graph.  It  was 
heuristically  recognized  that  the  problem  of  reconstructing  the  model  where  there  are  hidden 
variables  (some  of  the  variables  are  not  observed)  is  much  harder. 

Here  we  prove  that  the  problem  of  reconstructing  bounded-degree  models  with  hidden 
nodes  is  hard.  More  specifically  we  show  that 

•  Unless  NP  =  RP  it  is  impossible  to  decide  in  random  polynomial  time  if  two  models 
generate  distributions  whose  total  variation  distance  is  at  most  1/3  or  at  least  2/3. 

•  Given  two  generating  models  whose  total  variation  is  promised  to  be  at  least  1/3,  and 
an  oracle  access  to  a  sampler  that  generates  independent  samples  from  one  of  the  two 
models,  it  is  impossible  to  decide  in  randomized  polynomial  time  which  of  the  two 
samples  is  consistent  with  the  model  unless  NP  =  RP. 

These  two  results  are  tight  as  the  two  problems  are  solvable  in  polynomial  time  if  NP  =  RP 
For  the  second  problem  of  deciding  which  of  two  models  generates  a  distribution,  we  further 
consider  the  case  where  the  two  distributions  are  efficiently  samplable  and  show  that  in  this 
case  if  it  has  an  efficient  polynomial-time  algorithm  then  every  problem  that  has  a  zero- 
knowledge  proofs  is  decidable  in  random  polynomial  time  ( BPP ). 

3  Progress  in  Optimization  over  networks 

We  have  published  the  following  results  on  optimization  over  social  networks  with  Sebastien 
Roch. 

On  the  Submodularity  of  Influence  in  Social  Networks  [12].  Social  networks  are 
often  represented  as  directed  graphs  where  the  nodes  are  individuals  and  the  edges  indicate 
a  form  of  social  relationship.  A  simple  way  to  model  the  diffusion  of  ideas,  innovative 
behavior,  or  “word-of-mouth”  effects  on  such  a  graph  is  to  consider  an  increasing  process 
of  “infected”  (or  active)  nodes:  each  node  becomes  infected  once  an  activation  function 
of  the  set  of  its  infected  neighbors  crosses  a  certain  threshold  value.  Such  a  model  was 
introduced  by  Kempe,  Kleinberg,  and  Tardos  (KKT)  in  [8,  9]  where  the  authors  also  impose 
several  natural  assumptions:  the  threshold  values  are  random  and  the  activation  functions 
are  monotone  and  submodular.  The  monotonicity  condition  indicates  that  a  node  is  more 
likely  to  become  active  if  more  of  its  neighbors  are  active,  while  the  submodularity  condition 
indicates  that  the  marginal  effect  of  each  neighbor  is  decreasing  when  the  set  of  active 
neighbors  increases. 

For  an  initial  set  of  active  nodes  5,  let  a(S)  denote  the  expected  number  of  active  nodes 
at  termination.  Here  we  prove  a  conjecture  of  KKT:  we  show  that  the  function  <7(5)  is 
submodular  under  the  assumptions  above.  We  prove  the  same  result  for  the  expected  value 
of  any  monotone,  submodular  function  of  the  set  of  active  nodes  at  termination.  Roughly, 
our  results  demonstrate  that  “local”  submodularity  is  preserved  “globally”  under  this  dif¬ 
fusion  process.  This  is  of  natural  computational  interest,  as  many  optimization  problems 
have  good  approximation  algorithms  for  submodular  functions. 

Another  aspects  of  aggregating  information  from  different  sources  in  social  networks  is 
discussed  in  a  recent  work  with  G.  Schoenebeck: 
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Reaching  Consensus  on  Social  Networks  [14]  In  this  work  we  study  consensus  algo¬ 
rithms  on  social  networks.  Research  in  sociology  studies  the  effectiveness  of  social  networks 
in  achieving  computational  tasks.  Typically  the  agents  who  are  supposed  to  achieve  a  task 
are  unaware  of  the  underlying  social  network  except  their  immediate  friends.  They  have 
limited  memory,  communication,  and  coordination.  These  limitations  result  in  computa¬ 
tional  obstacles  in  achieving  otherwise  trivial  computational  problems.  One  of  the  simplest 
problems  studied  in  the  social  sciences  involves  reaching  a  consensus  among  players  between 
two  alternatives  which  are  otherwise  indistinguishable.  In  our  paper  we  formalize  the  com¬ 
putational  model  of  social  networks.  We  then  analyze  the  consensus  problem  as  well  as  the 
problem  of  reaching  a  consensus  which  is  identical  to  the  majority  of  the  original  signals. 
In  both  models  we  seek  to  minimize  the  time  it  takes  players  to  reach  a  consensus. 

A  different  kind  of  problem  involving  optimization  over  social  network  is  studied  in  a 
joint  work  with  Costis  Daskalakis  and  Alex  Dimakis,  two  students  from  U.C.  Berkeley.  In 
this  project  we  study  the  effect  of  the  connectivity  of  a  network  in  the  context  of  random 
graphical  games,  where  the  main  question  is  the  existence  of  mutually  optimal  strategies, 
i.e.,  Nash  Equilibrium 

Connectivity  and  Equilibrium  in  Random  Games  [7].  We  study  how  the  structure 
of  the  interaction  graph  affects  the  Nash  equilibria  of  the  resulting  game  In  particular,  for 
a  fixed  interaction  graph,  we  are  interested  if  there  exist  Nash  equilibria  which  arise  when 
random  utility  tables  are  assigned  to  the  players.  We  provide  conditions  for  the  structure 
of  the  graph  under  which  equilibria  are  likely  to  exist  and  complementary  conditions  which 
make  the  existence  of  equilibria  highly  unlikely.  Our  results  have  immediate  implications  for 
many  deterministic  graphs  and  generalize  known  results  for  games  on  the  complete  graph. 
In  particular,  our  results  imply  that  the  probability  that  bounded  degree  graphs  have  Nash 
equilibria  is  exponentially  small  in  the  size  of  the  graph  and  yield  a  simple  algorithm  that 
finds  small  non-existence  certificates  for  a  large  family  of  graphs.  In  order  to  obtained 
a  refined  characterization  of  the  degree  of  connectivity  associated  with  the  existence  of 
equilibria,  we  study  the  model  in  the  random  graph  setting.  In  particular,  we  look  at  the 
case  where  the  interaction  graph  is  drawn  from  the  Erdos- Renyi,  G(n,p),  where  each  edge 
is  present  independently  with  probability  p.  For  this  model  we  establish  a  double  phase 
transition  for  the  existence  of  pure  Nash  equilibria  as  a  function  of  the  average  degree  pn 
consistent  with  the  non-monotone  behavior  of  the  model  We  show  that  when  the  average 
degree  satisfies  np  >  (2  +  H(l))  logn,  the  number  of  pure  Nash  equilibria  follows  a  Poisson 
distribution  with  parameter  1.  When  1/n  <<  np  <  (0.5  —  0(1)) logn  pure  Nash  equilibria 
fail  to  exist  with  high  probability.  Finally,  when  np  <<  1/n  a  pure  Nash  equilibrium  exists 
with  high  probability. 

As  part  of  our  study  we  made  an  important  progress  in  understanding  how  non-linearity 
coupled  with  sparsity  can  lead  to  optimal  compression. 

Smooth  compression,  Gallager  bound  and  Nonlinear  sparse-graph  codes  [11] 
A  data  compression  scheme  is  defined  to  be  smooth  if  its  image  (the  codeword)  depends 
gracefully  on  the  source  (the  data).  Smoothness  is  a  desirable  property  in  many  practical 
contexts,  and  widely  used  source  coding  schemes  lack  of  it. 

We  introduce  a  family  of  smooth  source  codes  based  on  sparse  graph  constructions,  and 
prove  them  to  achieve  the  (information  theoretic)  optimal  compression  rate  for  a  dense  set 
of  iid  sources.  As  a  byproduct,  we  show  how  Gallager  bound  on  sparsity  can  be  overcome 
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using  non-linear  function  nodes. 


4  Progress  in  Message  Passing  and  MCMC  algorithms 

In  a  joint  work  with  my  student  Allan  Sly  we  considered  the  convergence  rate  of  MCMC 
Gibbs  samplers  for  sampling  graph  distributions  for  graphs  that  are  sparse  on  average.  Our 
results  are  the  first  to  establish  rapid  convergence  for  random  graphs.  The  results  are  in 
two  papers,  the  first  of  which  was  accepted  as  an  extended  abstract  to  SODA  2008,  while 
the  second  one  has  been  submitted. 

Rapid  Mixing  of  Gibbs  Sampling  on  Graphs  that  are  Sparse  on  Average  [15, 
16].  Gibbs  sampling  also  known  as  Glauber  dynamics  is  a  popular  technique  for  sampling 
high  dimensional  distributions  defined  on  graphs.  Of  special  interest  is  the  behavior  of 
Gibbs  sampling  on  the  Erdos-R4nyi  random  graph  G(n,d/n),  where  each  edge  is  chosen 
independently  with  probability  d/n  and  d  is  fixed.  While  the  average  degree  in  G(n,d/n) 
is  d(  1  —  o(l)),  it  contains  many  nodes  of  degree  of  order  log  n/ log  log  n. 

The  existence  of  nodes  of  almost  logarithmic  degrees  implies  that  for  many  natural  distri¬ 
butions  defined  on  G(n ,  p)  such  as  uniform  coloring  (with  a  constant  number  of  colors)  or  the 
Ising  model  at  any  fixed  inverse  temperature  0,  the  mixing  time  of  Gibbs  sampling  is  at  least 
+n(i  /  log  log  n)  Recau  that  the  Ising  model  with  inverse  temperature  0  defined  on  a  graph 
G  =  (V,  E)  is  the  distribution  over  {±} v  given  by  P(a)  =  ^  exp(0  o  (v)ff(ti)). 

High  degree  nodes  pose  a  technical  challenge  in  proving  polynomial  time  mixing  of  the  dy¬ 
namics  for  many  models  including  the  Ising  model  and  coloring.  Almost  all  known  sufficient 
conditions  in  terms  of  0  or  number  of  colors  needed  for  rapid  mixing  of  Gibbs  samplers  are 
stated  in  terms  of  the  maximum  degree  of  the  underlying  graph. 

In  this  work  we  show  that  for  every  d  <  oo  and  the  Ising  model  defined  on  G(n,d/n)y 
there  exists  a  0d  >  0,  such  that  for  all  0  <  0d  with  probability  going  to  1  as  n  — *  oo, 
the  mixing  time  of  the  dynamics  on  G(n,d/n)  is  polynomial  in  n.  Our  results  are  the  first 
polynomial  time  mixing  results  proven  for  a  natural  model  on  G(n,d/n)  for  d  >  T  where 
the  parameters  of  the  model  do  not  depend  on  n.  They  also  provide  a  rare  example  where 
one  can  prove  a  polynomial  time  mixing  of  Gibbs  sampler  in  a  situation  where  the  actual 
mixing  time  is  slower  than  npolylog(n).  Our  proof  exploits  in  novel  ways  the  local  treelike 
structure  of  Erdos- Rinyi  random  graphs,  comparison  and  block  dynamics  arguments  and  a 
recent  result  of  Weitz. 

Our  results  extend  to  much  more  general  families  of  graphs  which  are  sparse  in  some 
average  sense  and  to  much  more  general  interactions.  In  particular,  they  apply  to  any  graph 
for  which  every  vertex  v  of  the  graph  has  a  neighborhood  N(v)  of  radius  O(logn)  in  which 
the  induced  sub-graph  is  a  tree  union  at  most  O(logn)  edges  and  where  for  each  simple 
path  in  N(v)  the  sum  of  the  vertex  degrees  along  the  path  is  O(logn)  .  Moreover,  our  result 
apply  also  in  the  case  of  arbitrary  external  fields  and  provide  the  first  FPRAS  for  sampling 
the  Ising  distribution  in  this  case.  We  finally  present  a  non  Markov  Chain  algorithm  for 
sampling  the  distribution  which  is  effective  for  a  wider  range  of  parameters.  In  particular, 
for  G(n ,  d/n)  it  applies  for  all  external  fields  and  0  <  0d ,  where  dtanh(^)  =  1  is  the  critical 
point  for  decay  of  correlation  for  the  Ising  model  on  G(n,d/n). 

Gibbs  Rapidly  Samples  Colorings  of  G(n,d/n)  [18].  Gibbs  sampling  also  known 
as  Glauber  dynamics  is  a  popular  technique  for  sampling  high  dimensional  distributions 
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defined  on  graphs.  Of  special  interest  is  the  behavior  of  Gibbs  sampling  on  the  Erdos-R4nyi 
random  graph  G(n,  d/n),  where  each  edge  is  chosen  independently  with  probability  d/n  and 
d  is  fixed.  While  the  average  degree  in  G(n, d/n)  is  d(  1  —  o(  1)),  it  contains  many  nodes  of 
degree  of  order  log  n/  log  log  n. 

The  existence  of  nodes  of  almost  logarithmic  degrees  implies  that  for  many  natural 
distributions  defined  on  G(n,p)  such  as  uniform  coloring  (with  a  constant  number  of  colors) 
or  the  Ising  model  at  any  fixed  inverse  temperature  /?,  the  mixing  time  of  Gibbs  sampling  is 
at  least  n1+n(1/loglogn).  High  degree  nodes  pose  a  technical  challenge  in  proving  polynomial 
time  mixing  of  the  dynamics  for  many  models  including  coloring.  Almost  all  known  sufficient 
conditions  in  terms  of  number  of  colors  needed  for  rapid  mixing  of  Gibbs  samplers  are  stated 
in  terms  of  the  maximum  degree  of  the  underlying  graph. 

In  this  work  consider  sampling  ^-colorings  and  show  that  for  every  d  <  oc  there  exists 
q(d)  <  oo  such  that  for  all  q  >  q(d)  the  mixing  time  of  Gibbs  sampling  on  G(n,d/n)  is 
polynomial  in  n  with  high  probability.  Our  results  are  the  first  polynomial  time  mixing 
results  proven  for  the  coloring  model*  on  G(n,d/n)  for  d  >  1  where  the  number  of  colors 
does  not  depend  on  n.  They  also  provide  a  rare  example  where  one  can  prove  a  polynomial 
time  mixing  of  Gibbs  sampler  in  a  situation  where  the  actual  mixing  time  is  slower  than 
npolylog(n).  In  previous  work  we  have  shown  that  similar  results  hold  for  the  ferromag¬ 
netic  Ising  model.  However,  the  proof  for  the  Ising  model  crucially  relied  on  monotonicity 
arguments  and  the  “Weitz  tree’1  both  of  which  have  no  counterparts  in  the  coloring  setting 
Our  proof  presented  here  exploits  in  novel  ways  the  local  treelike  structure  of  Erdos- R^nyi 
random  graphs,  block  dynamics,  spatial  decay  properties  and  coupling  arguments. 

Our  results  give  first  FPRAS  to  sample  coloring  on  G(n,d/n)  with  a  constant  number 
of  colors.  They  extend  to  much  more  general  families  of  graphs  which  are  sparse  in  some 
average  sense  and  to  much  more  general  interactions.  In  particular,  they  apply  to  any  graph 
for  which  there  exists  an  a  >  0  such  that  every  vertex  v  of  the  graph  has  a  neighborhood 
N(v)  of  radius  O(logn)  in  which  the  induced  sub-graph  is  a  tree  union  at  most  0(1)  edges 
and  where  each  simple  path  T  of  length  O(logn)  satisfies  Euer  T.v*u  Qd(u,v)  =  O(logn). 
The  results  also  generalize  to  the  hard-core  model  and  other  models  where  there  are  both 
hard  and  soft  constraints. 

The  two  results  above  are  complemented  by  the  following  breakthrough  result  where 
the  exact  threshold  for  mixing  of  Gibbs  samplers  for  Ising  model  is  found. 

Exact  Thresholds  for  Ising-Gibbs  Samplers  on  General  Graphs  [17]  We  establish 
tight  results  for  rapid  mixing  of  Gibbs  Samplers  for  the  Ferromagnetic  Ising  model  on  general 
graphs.  We  show  that  if 

(d  —  1)  tanh  0  <  1, 

then  there  exists  a  constant  C  such  that  the  discrete  time  mixing  time  of  Gibbs  Samplers 
for  the  Ferromagnetic  Ising  model  on  any  graph  of  n  vertices  and  maximal  degree  d,  where 
all  interactions  are  bounded  by  0 ,  and  arbitrary  external  fields  is  bounded  by  Cn  log  n. 
Moreover,  the  spectral  gap  is  uniformly  bounded  away  from  0  for  all  such  graphs  as  well  as 
for  infinite  graphs  of  maximal  degree  d. 

We  further  show  the  when  dtanh/3  <  1,  with  high  probability  over  the  Erdos- R^nyi 
random  graph  G(n,d/n),  it  holds  that  the  mixing  time  of  Gibbs  Samplers  is 

n1+e(Erfcr;). 
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Both  result  are  tight  as  it  is  known  that  the  mixing  time  for  random  regular  and  Erdos- 
R4nyi  random  graphs  is,  with  high  probability,  exponential  in  n  when  (d  —  l)tanh/3  >  1, 
and  dtanh/3  >  1,  respectively.  To  our  knowledge  our  results  give  the  first  tight  sufficient 
conditions  for  rapid  mixing  of  spin  systems  on  general  graphs.  Moreover,  our  results  are 
the  first  rigorous  results  establishing  exact  thresholds  for  dynamics  on  random  graphs  in 
terms  of  spatial  thresholds  on  trees. 

We  also  study  message  passing  algorithm:  In  a  joint  work  with  Coja-Oghlan  and  Vi- 
lenchik  we  show  how  spectral  techniques  explain  the  success  of  Belief  Propagation  for  solving 
coloring  problems. 

A  Spectral  Approach  to  Analyzing  Belief  Propagation  for  3-Coloring  [6].  Con¬ 
tributing  to  the  rigorous  understanding  of  BP,  in  this  paper  we  relate  the  convergence  of 
BP  to  spectral  properties  of  the  graph.  This  encompasses  a  result  for  random  graphs  with 
a  “planted”  solution;  thus,  we  obtain  the  first  rigorous  result  on  BP  for  graph  coloring  in 
the  case  of  a  complex  graphical  structure  (as  opposed  to  trees).  In  particular,  the  analysis 
shows  how  Belief  Propagation  breaks  the  symmetry  between  the  3!  possible  permutations 
of  the  color  classes. 

The  next  problem  we  study  is  the  power  of  greedy  algorithms  in  inferring  random 
variables  on  networks. 

Iterative  Maximum  Likelihood  on  Networks  [19]  We  consider  n  agents  located  on 
the  vertices  of  a  connected  graph.  Each  agent  v  receives  a  signal  Av(0)  ~  N(/x,  1)  where  /x 
is  an  unknown  quantity.  A  natural  iterative  way  of  estimating  /x  is  to  perform  the  following 
procedure.  At  iteration  t- hi  let  Av(£+1)  be  the  average  of  X„(*)  and  of  Xu,(*)  among  all  the 
neighbors  w  of  v.  It  is  well  known  that  this  procedure  converges  to  X(oo)  =  ^|2£|~1  v 

where  dv  is  the  degree  of  v. 

In  this  paper  we  consider  a  variant  of  simple  iterative  averaging,  which  models  “greedy” 
behavior  of  the  agents.  At  iteration  t,  each  agent  v  declares  the  value  of  its  estimator  Xv(t) 
to  all  of  its  neighbors.  Then,  it  updates  Xv(t  +  1)  by  taking  the  maximum  likelihood  (or 
minimum  variance)  estimator  of  /x,  given  Xv(t)  and  Xu ,(t)  for  all  neighbors  w  of  v,  and  the 
structure  of  the  graph. 

We  give  an  explicit  efficient  procedure  for  calculating  Xv(t),  study  the  convergence  of 
the  process  as  t  — >  oo  and  show  that  if  the  limit  exists  then  Xt,(oo)  =  Au,(oo)  for  all  v 
and  xv.  For  graphs  that  are  symmetric  under  actions  of  transitive  groups,  we  show  that  the 
process  is  efficient.  Finally,  we  show  that  the  greedy  process  is  in  some  cases  more  efficient 
than  simple  averaging,  while  in  other  cases  the  converse  is  true,  so  that,  in  this  model, 
“greed”  of  the  individual  agents  may  or  may  not  have  an  adverse  affect  on  the  outcome. 

The  model  discussed  here  may  be  viewed  as  the  Maximum-Likelihood  version  of  models 
studied  in  Bayesian  Economics.  The  ML  variant  is  more  accessible  and  allows  in  particular 
to  show  the  significance  of  symmetry  in  the  efficiency  of  estimators  using  networks  of  agents. 

4.1  Progress  on  Learning  from  Multiple  Sources 

We  finally  briefly  note  work  with  J.  Arpe  on  learning  from  multiple  sources: 

Application  of  a  Generalization  of  Russo’s  Formula  to  Learning  from  Multiple 
Random  Oracles  [1]  In  this  paper  we  study  the  problem  of  learning  low  complexity 
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functions  called  k-juntas  given  access  to  examples  drawn  from  a  number  of  different  product 
distributions.  Thus  we  wish  to  learn  a  function  /  :  { — 1,1  }n  — >  {  —  1,1}  that  depends  on 
k  (unknown)  coordinates.  While  the  best  known  algorithms  for  the  general  problem  of 
learning  a  A; -junta  require  running  time  which  exceeds  nk ,  we  show  that  given  access  to  k 
different  product  distributions  with  biases  separated  by  e  >  0  ,  the  functions  may  be  learned 
in  time  which  is  polynomial  in  n  and  k.  Our  techniques  involve  novel  results  in  Fourier 
analysis  relating  Fourier  expansions  with  respect  to  different  biases  and  a  generalization  of 
Russo’s  formula 
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